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Abstract 

This paper is devoted to a systematic study of a class of binary trees 
encoding the structure of rational numbers both from arithmetic and 
dynamical point of view. The paper is divided into two parts. The 
first one is a critical review of rather standard topics such as Stern- 
Brocot and Farey trees and their connections with continued fraction 
expansion and the question mark function. In the second part we in- 
troduce a class of one-dimensional maps which can be used to generate 
the binary trees in different ways and study their ergodic properties. 
This also leads us to study some random processes (Markov chains and 
martingales) arising in a natural way in this context. 
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1 Part one: arithmetics 



Notational warning: In the sequel we shall use the following notations: 



/ := [0,1] 

J := [0,oo)U{oo} 

Qi := Qn[o,i] 

Qp ■■= lj s : s£N, 0< k<p s Y p>2 



1.1 A class of binary trees 

We start with the Stern-Brocot (SB) tree T, which is a way to order (and 
thus to count) the elements of Q + , the set of positive rational numbers, so 
that every number appears (and thus is counted) exactly once (see [St], |Br] 
and, for a modern account, |GKP] ). The basic operation needed to construct 
T is the Farey sum: given | and |? in Q + set 

p p' _ p + p' 

~ ^ ~7 ~ i 7 
q q q + q 

One notes that the child 1 ^ turns out to be in lowest terms whenever the 
parents | and ^ do. Moreover, the child always lies somewhere in between 
its parents, e.g., assuming | < ^, we have | < ^±2_ < 

Starting from the ancestors and oo (written 'in lowest terms' ) one then 
writes genealogically one generation after the other using the above opera- 
tion: 



I 

1 o 

l 
i 

1 2 

2 1 

12 3 3 

3 3 2 1 

123345 54 
455433 21 



1233455457877875 
5787787545543321 
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and so on. The easily verified property which makes the above interesting 
and useful is the following fact: if | and K are consecutive fractions at any 
stage of the construction then the unimodular relation qp' — pq' = 1 is in 
force. 

Finally, the subtree J- of T having ^ as root node and vertex set Qi is 
called Farey tree. It can be obtained exactly in the same way as T taking 
as ancestors j and j instead of j and q. 

Lemma 1.1. Let <j> : J — » / be the invertible map defined by (f>(oo) = 1 and 

6{x) = — ^— , x G M + 
y ' x + 1 

Then 

<KT) = F 

Proof It suffice to notice that <ft(j) = j, <j>{ h ) = j, <ft(j) = \ and for 
x, x' £ Q+ we have <j){x) <f>(x') = <j>{x ®x'). □ 

Another structure we shall deal with is the dyadic tree T>, whose first two 
levels are as in T and then can be constructed from the root node \ by 
writing under each vertex | the pair 2 ^~ x and . The vertex set of T> is 
Q2. We shall see later how it is related to T and T . 

1.2 Continued fractions and the {L, R} coding 

Every x € Q + appears exactly once in the above construction and corre- 
sponds to a unique finite path on T starting at the root node j and whose 
number of vertices equals the depth of x, i.e. the level of T it belongs to. 
For x £ Qi one may just consider the path on the subtree J- which starts at 
the root node \ and whose number of vertices is the rank of x. For x £ Q + 
we have 

depth(j;) = [x] + rank({x}) + 1 (1.1) 

In order to properly code these paths we start recalling that every rational 
number x £ Q + has a unique finite continued fraction expansion [Kh 

x = a H = [a Q ; at, . . . , a n ] 

ai H 

1 

+— 

a n 

with ao > 0, ai > 1 for 1 < i < n and a n > 1. 
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Lemma 1.2. Let x G Q + i/ien 

n 

x = [o ; 01, . . . ,a n ] depth(x) = ^ a, 

i=0 

Proof. Setting depth(°) = depth(g) = we have depth(^) = 1. Let now 
x = [ao; ai, . . . , On] be such that depth(x) = d > 1. Then, in order to reach 
the leaf x from the root \ one has first to move ao steps to the right, thus 
reaching the node ao + \. Then, moving a\ steps to the left one reaches 
ao H 7x . d2 further steps to the right reach the point ao H ; — i — and so 

ai + y ' ' aiH ■ t 

a 2+f 

on. In this way, one sees that the path to reach x makes exactly n turns and 
the length of the blocks within the (i — l)-st and the i-th turn is given by the 
partial quotient a» for 1 < i < n, whereas the last block has length a n — 1. 
More precisely, the blocks moving to the left are related to partial quotients 
with odd index, those moving to the right to those with even index. It then 
follows at once that d = Y17=o a «- '— ' 

The argument sketched above actually allows us to say more. To this end, 
we shall first construct a matrix representation of the positive rationals. We 
start noting that a given x £ Q + can be uniquely decomposed as 

P m P' ' '1 

x = - © — with qp — pq =1 

Q q 

The neighbours | and ^ are thus the parents of x as an element of T. We 
then identify 



p' p 



£ 5L(2,Z) 



q q 

Note that the left column bears on the right parent and viceversa. In this 
way, the root node yields the identity matrix: 

1 _ 1 f 1 

i"i®o^ Vo 1 

Moreover, given M 6 SX(2,Z) which represents the fraction x € Q + , the 
matrix U MU represents the symmetric fraction 1/x, with 



U = u- 1 



1 

1 



In particular 
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t S J \ 1 1 J \ S + t S J S S + t 

n m \ / 1 1 \ / n m + n \ m + n n 



More generally, for G N, 

Now, the point x considered above has in turn a unique pair of (left and 
right) children, given by 

m m + n m + n n 

- — — and — — e - 
s s + 1 s + 1 t 

respectively. Moreover, 

n m \ ( 1 \ / m + n m \ m m + n 



and 

( t s J \ 1 J \ t s + t J ~ ' s + t ^ t 

In other words, the matrices L and R, when acting from the right, move to 
the left and right child in T, respectively. Together with the argument of 
the proof given above this yields the following result. 

Proposition 1.3. To each entry x £ T there corresponds a unique element 
X G SL(2,Z), for which we have the following two possibilities: 

• x = [a ;ai,...,a n ], n even =>- X = R a °L ai ■ ■ ■ L "- 1 /? " -1 

• x = [a ;ai,...,a n ], n odd => X = R a °L ai R a ' 2 ■ ■ ■ L an ~ l 
As an easy consequence we have the 

Corollary 1.4. Let x = [ao; a±, . . . , a n ] with a n > 1 and n even. Then 
its left and right children in T are given by x' = [ao; a±, . . . , a n — 1, 2] and 
x" = [ao; ai, . . . , a n + 1], respectively. If instead n is odd the expansions for 
x' and x" are interchanged. 

Proof. For n even and larger than one we have x = [ao; ai, . . . , a n ] <^=^ X = 
R at >L ai R a2 ■■■R a "- 1 . Therefore x' X' = R at >L ai R a2 ■■■R a ^ 1 L and 
x" <^=^ X" = R a °L ai R a2 ■ ■ ■ R a " which yield the claim. A similar reasoning 
applies for n = and n odd. □ 
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1.3 The infinite coding 

One can extend the above construction by associating to each x G IR + a 
unique infinite path in T, or else a unique semi-infinite word in %{x) G 
{L, in the natural way. First, to x G M + \ Q + with infinite continued 
fraction expansion x = [ao; ai, 02, 03, . . . ] there will correspond the (unique) 
sequence ir(x) = R a ° L ai R a2 L a3 . . . , where now R and L are nothing but 
elements of a binary alphabet. For rational x we can proceed as follows. 
First we set ir(j) = L°° and 7r(g) = Then note that each x G Q + has 
two infinite paths which agree down to node x: they are those starting with 
the finite sequence coding the path to reach x from the root node according 



to Proposition 1.3 and terminating with either RL°° or LR°°. We shall agree 
that 7r(x) terminates with RL°° or LR°° according whether the number of 
its partial quotients of x is even or odd. Summarizing we have the following 
coding 

• x = [a ; ax, . . .,a n ], n even =^ ir(x) = R a °L ai ■ ■ ■ R an L°° 

• x = [a Q ;ai,...,a n ], n odd => vr(x) = R a ° L ai ■ ■ ■ L a - R°° 

• x = [00501,02,03,...] => tt(x) = R a ° L ai R a2 L a3 .... 

One easily checks that if >- denotes the lexicographic order on {L, then 

x > y =^ 7r(x) y ir(y) 

Finally, from the above it follows that for an irrational x the infinite path 
on T converging to x coincides with the slow continued fraction algorithm 
(see, e.g., |S3). 

1.4 The (extended) question mark function 

Given a number x G M + with continued fraction expansion x = [ao; 01, a%, . . . ] 
one may ask what is the number obtained by interpreting the sequence tt{x) 



defined in Section 1.3 as the binary expansion of a real number in (0,1). 



The number so obtained, denoted p(x), writes 

p(x) = .11^^00^^ U^_l ■■■ (1.2) 

ao ai 0,2 

or, which is the same, 

p(x) = 1 _ J^(-l) fc 2-( a o+- +a *) (1.3) 
fc>0 
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For instance p(l/n) = 1/2™ and p{n) = 1 — 1/2™ for all n > 1. Setting 
p(0) = and p(oo) = 1 we see that p : R + — > I satisfies 

^(x) =? o (x) (1.4) 

where (ft : J — > / is the map defined in Lemma and ? : I — * I is the 
Minkowski question mark function [M] . which for x = [0; ai, 02, . . .] is given 
by 

?(x) = 0.00...011...1 00...0 ••• (1.5) 

01 — 1 02 <13 

Differently said, for x G (0, 1) the number ?(x) is obtained by interpreting 
the symbolic sequence corresponding to the path which starts from the root 
node 2 and approaches x along the Farey tree T as a binary expansion of a 
real number in (0, 1). 

We now need a simple lemma. 
Lemma 1.5. 

{[0; 1 + a 2 ,a 3 , . . .] if a% = 1 
[0; l,ai - l,a 2 , . . .] if ai > 1 

Proof. Upon application of the identity 

11 ,11 

r + i = 1 and =- + = 1 □ 

Proposition 1.6. The functions ? and p satisfy the functional equations 

?(x)+?(l - x) = 1 , xel (1.6) 

and 

p(x) +p(l/x) = 1 , xGJ (1.7) 



Proof. The equation for ? follows at once from Lemma 1.5 and (1.5). That 
for p then follows by (1.4). □ 



Additional properties of p are inherited via (1.4) from the following proper- 
ties of ? (see [SI], jE], jVPB] . (Vj): 

• ?(x) is strictly increasing from to 1 and Holder continuous of order 

a - log 2 
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• x € Qi iff ?0) G Q 2 

• x is a quadratic irrational iff ?(x) is a (non-dyadic) rational 

• ?(x) is a singular function: its derivative vanishes Lebesgue-almost 
everywhere 



1 X 

l [ 



0.8 



0.6 



0.4 



0.2 



0.2 0.4 0.6 0.8 

Figure 1: The ? function 



For any pair - and ^ of consecutive fractions in T the function p (as ? on 
J-) equates their child to the arithmetic average. For instance we have 

'p + p' 



P 



q + q' 



'MM'* 



(1.8) 



Therefore the functions p and ? map the SB tree T and the Farey tree 
respectively, to the dyadic tree T> mentioned above. Note that the set T>k 
of dyadic fractions belonging to the first k + 1 levels of T> is the uniformly 
spaced sequence l/2 k , I = 0, 1, . . . , 2 k . Reducing to the lowest terms we get 



V 









?■ 











1 1 

T'2'T 



v 2 



113 1 

T'4'2'4' 1 



and so on. Hence, an immediate consequence of the fact that p{T) =?(J 7 ) = 
D is that and ?(x) are the asymptotic distribution functions of the 
sequences of SB fractions and Farey fractions, respectively. 
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Theorem 1.7. Set % := {£ G T : depth(x) < fc}. Since 



£ £ 

2 fc 



-fc 



we have 



p(x) - 



2 k 



< 2 



< 2" 



TTie same ZioWs /or ? mi/t 7^ replaced by Tk '■= {? £ J 7 : rank(x) < A;}. 

In particular, the Fourier-Stieltjes coefficients of p and ? are as in the fol- 
lowing 



Corollary 1.8. Let 



then 



z n := \ e 2ninx dp{ 
Jo 



2 7r i n 2 
e 9 



T7ie same ZioWs /or i/ie coefficients of? with 7^ replaced by T^. 



1.5 Permuted trees 



Let X G SL (2,7.) represent a number x G Q + as above and write it as 
X = Yli=i Mi where Mi G {L,R} and k = depth(x). Let us denote by 
x the positive rational number represented by the reversed matrix product 
X = rii=fc-^- Clearly depth(x) = depth(x) but x = x if and only if the 
sequence M\ . . . is a palindrome. The permutation map x i— ► x yields the 
permuted version T of the SB tree whose first five levels are shown below 
(the ancestors [j and ^ are omitted). 

l 
l 



1 2 

2 1 

13 2 3 

3 2 3 1 

143525 34 
435253 41 



1547385727583745 
5473857275837451 
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Lemma 1.9. Under the ancestors j and ^, the permuted SB tree T can be 
constructed starting from the root node j and writing under each vertex | 

the set of descendants {^j^, 
Proof. Note that 

1 \ / n m \ ( n m \ m + n 



1 1 J \ t s J V n + s m + t J m + n + s + t 



1 1 \ f n m \ / n + s m + t \ m + n + s + t 



and 



1 J \ t s J V s t J s + t 

In other words, the matrices L and R, when acting from the left give but 
the left and right descendants, respectively. 

Also note that if | = [ao; a±, . . . , a n ] then = [0; 1, ap, ai, . . . , a n ] and 
^ = [ao + 1; ai, . . . , a n ]. Therefore 

depthf^— ) = depth(^ti) = depth(^) + 1 
p + q q q 

This yields the claim. □ 



Remark 1.10. The tree T has been considered in JCW)/ where the authors 
argued that if we read it row by row, and each row from left to right, then for 
i > 2 we can write the i-th element in the form Xi = b{i — 2)/b(i — 1), where 
b{n) is the number of hyperbinary representation of n, namely the number 
of ways of writing the integer n as a sum of powers of two, each power being 
used at most twice. For example 8 = 2 3 = 2 2 + 2 2 = 2 2 + 2 + 2 = 2 2 + 2 + l + l 
and therefore 6(8) = 4. This property plainly entails that when reading from 
left to right any sequence of fractions with fixed depth the denominator of 
each fraction is the numerator of its successor. 



We finally define the corresponding permutation of both the Farey tree T 
and the dyadic tree T>, den oted T and T> respectively. Clearly we have 



J- = (j>(T) (see Lemma 1.1). Reasoning as above one easily obtains the 



following simple genealogical rules: 

Lemma 1.11. Under the ancestors ? and \, the permuted trees T and T> 
can be constructed starting from the root node \ and writing under each 
vertex | the sets of descendants {^j, 2^5} an d ^f}> respectively. 
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The first five levels of T are 



1 

2 

1 2 

3 3 



13 2 3 

4 5 5 4 

143 52 5 34 

578 7 7 8 75 



I5_Aj1_3__8_JlZ2 j!_5__8__3_J145 
6 9 11 10 11 13 12 9 9 12 13 11 10 11 9 6 



and the corresponding levels of V are 



l 

2 



I 3 
4 4 

15 3 7 

8 8 8 8 

_L_9__5_13_3_ 11 7_ 15 

16 16 16 16 16 16 16 16 



J_17J3_25_5_211329_3_ 191127X231531 
32 32 32 32 32 32 32 32 32 32 32 32 32 32 32 32 



1.6 Random walks on the permuted trees 

We now construct a sequence of random variables Z\ , Zi , . . . on Q + defined 
recursively in the following way: set Z\ = \ and if Z k = | then either 

z k+i = pq^ or Z k+ i = both with probability \. The sequence (Z k ) k >i 

can be regarded as a (symmetric) random walk on T. 

Theorem 1.12. The random walk (Z k ) k >i enters any non-empty open in- 
terval (a, b) C M + almost surely. 

Proof. Pick up an irrational number x = [aojai,--.] €. (a,b). Then for 
n large enough we can find a closed subinterval A C (a, b) such that the 
c.f. expansion of each element of A starts as [ao; ai, • ■ ■ , a n , . . .]. To fix the 
ideas and with no loss, let n be odd. Then, according to the above and 
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Proposition 1.3 the path on T starting from the root j and entering A (for 
the first time) will eventually end with the word W = L an ~ x ■ ■ ■ R a2 L ai R a °. 
Hence it has the form UW with prefix U G {L, R}* so that UW does not 
contain subwords equal to W but W itself. We now proceed by induction 
on the length £ of W, calling We the word of length I. If £ = 1 then there is 
exactly one prefix U of lenght k for each k > 1 (e.g. if W\ = L then U = R k 
is the only possible prefix) occurring with probability 2~ k . Summing over 
the prefixes we get £\>i 2~ fc = 1. Theferore the claim is true for £ = 1. Now 
suppose it is true for £ = m. When passing to £ = m+1 either W m +i = W m L 
or W m+ i = W m R, hence we have two families of paths UW m L and UW m R, 
one of which being UW m +i and thus, by the induction hypothesis, having 
probability |. We are now left with all paths starting with the 'bad' ones 
and eventually ending with W m +i. But then we can use the self-similarity 
of the tree and iterate the above construction. Suppose for instance that 
the 'bad' set was UW m R, that is W m+ i = W m L. Then at some point 
we will end up with the alternative UW m RU'W m L and UW m RU'W m R for 
some U' e {L,R}*, and the 'good' set UW m RU'W m L has probability \ ■ \. 
Iteration of this argument yields the probability ^ + + • • ■ = 2 which has 
to be added to the probability \ of the initial 'good' set UW m L. □ 

Remark 1.13. The above result can be easily extended to both J- and T>. 
However, it seems to be peculiar of the particular permutation which defines 
these trees, in particular it is plainly false for the original Stern-Brocot tree 
T (as well as for T and V). We shall see later a further generalisation. 

2 Part two: dynamics 

We shall now be dealing with a class transformations which generate the 
permuted trees T, T and T>, respectively, either one generation after the 
other or in genealogical way, i.e. producing elements with increasing depth. 

2.1 Rank one ergodic transformations with dense orbits of 
rationals 

It was noticed in (NJ that the sequence Xi of elements of T satisfies the 
iteration 

*» = 1 - W + M ' !ao (2J) 
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We are thus led to study the marj^j-R : J —> J given by R(oo) := and 

*w := i- w + i,] ■ " K+ <2 - 2) 

Proposition 2.1. 

1. R is an automorphism o/M + 

2. for any G Q + j/ and on/y if x £ Q + 

3. R counts the set Q + U {0} U {oo} in the following sense: let Xi be the 
sequence obtained by reading T row by row and each row from left to 
right (except the zero-th one), then xi = R % {\)- 

Proof. One easily checks that R is one-to-one and onto, with inverse 

i . , 11 1 , , 

R- 1 (x)=2n + l , <x<- , n>0. (2.3) 

x n + 1 n 

This proves 1. Statement 2 is immediate. Moreover, if x £ N then R(x) = 
l/(x + 1) so that depth(i?(x)) = depth(:r) + 1. If instead x = [oo; ai,...,a n ] 
with n > 1 then R(x) = l/(ao + 1 — {x}) so that depth(J2(x)) = depth(x) 
since depth(l/x) = depth(j;) and rank({x}) = rank(l — {x}). This yields 
the first part of statement 3. To see the second part we start observing that 
if we write x in the form x = {kq + r) jq with k > and < r < q we have 
R(x) = q/{kq + q — r). Now if k = then a; = r/q and = q/(q — r), 

namely x and are left and right descendants of the fraction r{/q — r). If 
instead k > then x is the right descendant of x' = ((k — l)q + r) jq whereas 
R{x) is the left descendant of x" = q/(kq — r), and x" = R(x'). □ 



Remark 2.2. Note that, although the sequence (xj)j>o defined in (2.4) is 
dense in R + , it 'diffuses' only logarithmically. Indeed we have X{ = n for 
i = 2" and therefore sup 0<i<n Xi = O(logn). In fact, from what is proved 
below it follows that all orbits {R l {x) , i > 0}, x € M + , are dense and have 
this property. An automorphism of the unit circle with similar properties 
has been constructed in |Bo . 



We now restrict to the unit interval and consider two automophisms on it. 
The first one is the the map S : I — ► I defined by (see Lemma |1.1[ ) 

S(x) :=(j)oRo(j)- l {x) (2.4) 



1 The study of this map was suggested to one us us (C.B.) by Don Zagier 
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or else by S(l) = and 
S(x) = - 



1 — x 



+ 



1 — X 



x e [o,i) 



(2.5) 



Its inverse is 



2 nx — 1 
(2n+l)x - 1 



1 1 

< x < - 

n + 1 n 



n > 1 



(2.6) 



The second is the classical Von Neumann-Kakutani transformation T : I 
I given by T(l) := and 



T(x) := x + 



2 n+l 



1 1 

1 , 1 -r 

' 2 n — 2 n ~^~ 



n > 1 



(2.7) 



It was defined in |VN\ and is also called van der Corput's transformation or 
else dyadic rotation. 




Theorem 2.3. We have the following commutative diagram 



J 



J 



I 



Proof. The left half follows immediately from (2.4). To see the right half part 
let x £ (0, 1) be given by x = [0; a±, a 2 , • • • ]■ We have = [0; 01 — 1, 02, ■ • .] 
so that 



1 — x 
and therefore 



02 if a± = 1 
if ai > 1 



1 — x 



[0; 03,04,...] if 01 = 1 
[0;oi - l,o 2 ,...] if oi > 1 



5(x) 



(2 + o 2 - [0; 03,04,...]) 1 if at = 1 
(2 - [0;oi - l,a 2 ,...})- 1 if oi > 1 



(2- 



S(x) 



(2.9) 



Using Lemma 1.5 this becomes 

[0; a 2 + 1, 1, 03 - 1, 04, ... ] if ai = 1 

[0; l,l,oi — 2, a 2 ,...] if 01 > 1 

where for > £ > 1 for some k > 1 we set 

[00; ai, 02, . . . , afc_i,afe — £, Ofc+i, • • • ] = [oo; ai, 02, • • • , Ofc-i + Ofc+i, • • • ] 

if afc = I. On the other hand, the map T{x) is also named dyadic rotation 
because of the following fact (see, e.g. |PF] . p. 120): if we expand x £ [0, 1] 
in base two, i.e. we write x = YlT=o UJ k2~ k ~ 1 = O.UJ1UJ2 ■ ■ ■ with uik £ {0, 1}, 
it acts as T(0.111 . . . ) = 0.000 ... and for n > 1 

T(0.U_^A0Lj n+1 Lj n+2 ...) = 0.0^0 luj n+1 uj n+2 ... (2.10) 

n— 1 n—1 



Therefore if x = [0; 01, 02, . . . ] then using ( 1.5 ) we find 

( 0.00...0100...011...1... if oi = 1 



r(7(x)) 



0-2 0,3 — 1 0,4 

0. ioo_^__o 11^1. 

a\—2 a2 



if oi > 1 



(2-11) 
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which is identical to what we obtain applying ? to (2.9). □ 



We now derive some consequences from the above theorem. 

Corollary 2.4. The maps S and T count the sets Qi and Q2, respectively. 
More specifically, let for instance yt be the sequence obtained by reading T 
row by row and each row from left to right (except the zero-th one), then 
yi = S l (j). A similar statement holds for T> and T. 



Proof. Follows at once from Proposition |2.1| and Theorem 2.3 □ 



Corollary 2.5. The systems (J,R) and (I,S) are uniquely ergodic and 
hence (J,R,dp) and (I,S,d?) are ergodic. 

Proof. The first statement follows from the above as topological conjugacy 
preserves unique ergodicity and the system (I, T) has this property. More- 



over, the Lebesgue neasure dx is T-invariant so that by the above and (1.4) 
the maps R and S preserve the measures dp and dl respectively. □ 

Corollary 2.6. The systems (J, R) and (I, S) are of rank one. Moreover 
they have the same spectrum which is discrete with eigenvalues e 27Tia for any 
dyadic rational a. 

Proof. The system (I, T) has this property. Let us briefly recall how this 
is obtained. One start setting A(l,n) = [0,2 _n ) for n > and noticing 
that T maps in an affme way A(i,n) = T l ~ l A(l,n) onto A(i + l,n) for 
i = 1, . . . ,2 n . Clearly, these intervals are not ordered lexicographically but 
in the way induced by T. For example, for n = 3 we have 000 1— ► 100 1— > 
010 1 — ^ 110 1 — ^ 001 1 — ^ 101 1 — ^ 011 1 — ^ 111. One may then write the so 
ordered intervals one above the other, thus making a stack which partitions 
the whole space. The action of T is then that of climbing up one level in 
the n-stack but is not defined on the top level. At step n + 1, i.e. looking 
at the action of the iterates of T on A(l,n + 1), the stack is cutted into 
two equal halves and the right half is stacked on the left half. This defines 
the action of T on a finer partition of the space. This procedure eventually 
leads to the knowledge of T on the whole space. Finally, to get the same 
property for (I, S) it will suffice to follow the above procedure with the 
family of intervals B(i,n) =?~ 1 (A(i,n)) (stacked in same order). Clearly, 
although all the intervals A(i, n), i = 1, . . . , 2 n do have the same length 2~ n , 
the corresponding B(i,n) do not. A similar construction can be done for 
(J,G) with the intervals C(i,n) = 4>~ l (B(i,n)). The last assertion follows 
again from the same property for (J, T) along with topological conjugacy 
(see, e.g., [PF], p.23). □ 
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Finally, setting e n (x) := e 27rinx , the Fourier-Stieltjes coefficients of p are 

POO 

c n := / e n {x)dp{x) (2.12) 

J 



By Corollary 1.8 they can be computed as 



lim —r e n ( 



(2.13) 



On the other hand, the unique ergodicity of (J, R) following from Corollary 



2.5 ensures that they can also be computed as ergodic means in a uniform 
way. 

Corollary 2.7. We have, uniformly for x € J, 

1 

Cn = J™ M Yl e n{R k (x)) 
N^oo iv ' — * 
fe=0 

The same holds for the coefficients of ? with R replaced by S. 

Remark 2.8. An interesting question is whether c n — > as n — > oo (see 
JSaf ). A solution to this question would give some insights into the degree of 
uniformity of the distribution of the dense sequence {R k (x)}k>o, x E J, and 
in particular of the permuted Stern-Brocot sequence obtained setting x = 1. 

2.2 Markov maps and transfer operators 

We now introduce three non-invertible maps which generate the trees T, J- 
and T> genealogically, i.e. via descendants. With the notations of Theorem 



2.3, the first one is the map G : J —> J given by 

x 

G(x 



if < x < 1 



1 — x 
x — 1 if x > 1 



(2.14) 



The second is the modified Farey map F : I — > / given by 

x 

F(x) 



if < x < i 
1 — x z 



(2.15) 



2 - I if \ < x < 1 
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and the third is doubling map D : I — > / given by 

D{x) = 2x (mod 1) 



(2.16) 



They are expansive orientation preserving piecewise analytic endomorphisms 
such that the sets G~ 1 (x), F~ 1 (x) and D~ 1 (x) are composed exactly by two 
points for each x. More specifically 



G-\x) 
F-^x) 



x 



x + 1 



1 + x 

X 1 



1 + x 2-x 



x G J 
D-\x) 



XX 1 

2' 2 + 2 



, x <E I 



Both F and D fix the boundary points and 1 , but for F these are indifferent 
fixed points, i.e. -F'(O) = -F'(l) = 1. More specifically, is a weakly repelling 
fixed point whereas 1 is weakly attracting. On the other hand we can say 
that G has two indifferent fixed points at and oo. 

Theorem 2.9. The permuted tree T can be constructed genealogically from 
its root \ by writing under each leaf x the set of descendants G^ 1 (x). The 
same can be done for T and T> starting from their root \ with the sets of 
descendants F~ 1 (x) and D~ 1 (x), respectively. Furthermore, we have the 
following commutative diagram 



J 



G 



J 



I 



D 



Proof. The first assertion follows from Lemma 1.9, eq. (2.22) and Lemma 
1.11 The proof of the conjugation between G and F is immediate. That 
for F and D can be obtained reasoning along the same lines as in the proof 
starting from the observation that D acts a the shift on 



of Theorem 2.3 



binary expansions whereas the action of F is the Farey shift [0; a\, <22, . . 
[0; a\ — 1, d2, . . . ] on the inte rval [0, |] and [0; 1, a%, . . .] h- > 1 — [0; 02, . . 



on 



(ill- 

□ 



Then use Lemma 



1.5 



We leave the details to the interested reader. 



Remark 2.10. Conversely, using the maps G, F and D one can retrace 
the path from a leaf x in any of the trees T, T or T> back to the root. 
For instance, for x G T let X = Y\i=i Mi be the element which uniquely 
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represents x in SX(2,Z) with k = depth(x), according to Proposition 1.3 



One then sees that the following rule is in force: if G^ l \x) < 1 then 
Mi = L, G^\x) > 1 then Mi = R, for i = 1, k with k = depth(x) so 
that G k (x) = 1. 

Remark 2.11. The map D preserves the Lebesgue measure dx on /, whereas 
the map F preserves the a.c. infinite measure fj,(dx) = dx/x{l — x) = 
{JL log dx on /, as one easily checks. This entails that G preserves 

the (infinite) measure v{dx) = (i o cp(dx) = dx/x on J. 
Note that the entropy of (J, F, dfj) is zero (as well as that of (J, G, dv)). On 
the other hand, from the above theorem it follows that also the measure d? 
is invariant under F (as well as dp for G) and the entropy of (I, F, dl) is 
log 2. Therefore dl is the measure of maximal entropy for (I,F) (as well as 
dp for (J, G)). 

To the map G we associate a generalised transfer operator L q acting on 
/ : J — > C as 

2/eC7 1 (a:) 

or else 

^W-(IT5^(iTs) + «* + 1 ) (2 ' 18) 

where g is a real or complex parameter. We point out that a continuous 
fixed function for L q satisfies the functional equation 



/M^ + iHjTT^f/^J 019) 

which is called Lewis-Zagier three-term functional equation and is related to 
the spectral theory of the hyperbolic laplacian on the modular surface (see 
|LeZa] and references therein). 

In the same way, the operators associated to D and F act on / : I — > C as 
and 



respectively. For the spectral theory of an operator closely related to (2.21) 
see |I| and |BGI| . 
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2.3 Harmonic functions and martingales 

Let $ s , s G {0, 1}, be the inverse branches of G, i.e. 

$ (x) = -?— , 0> 1 (x) = x + l (2.22) 
1 + x 

They satisfy: 

$s(1/x) = ¥^) ' ( 2 - 23 ) 

Let moreover p(s, •), s G {0, 1}, be a pair of positive Borel functions such that 
p(0, x) +p(l, x) = 1, Vi G J. We now want to study the Markov chain with 
state space J where at each step, starting from a state x G J, two transitions 
are possible towards the states $o(^) and ^i(s), with probabilities p(0,x) 
and respectively. Note that for x = \ and p(i,x) = ^, i = 0,1, this 



Markov chain reduces to the random walk on T discussed in Theorem 1.12 



We now briefly adapt to our context some basic facts about canonical 
Markov chains associated to Markov transfer operators (see jCoRa] ; also 
|CoRal| for an application to the dyadic transfer operator (2.20)). Let 
P : L°°(J) — > L°°(J) be the Markov operator acting as 

(Pf)(x) = p(0, x) f (cDo(x)) + p(l, x) f (2.24) 

A measurable function h : J — ► C satisfying P/i = /i is called P-harmonic. 
In the sequel we shall make the further assumption that the transition prob- 
abilities satisfy: 

p{s,l/x) =p(l - s,x) , s£{0,l}, Vx G J (2.25) 

The symmetries ( 2.23| ) and (2.25) yield at once the following 

Lemma 2.12. The averaging operator A : L°°(J) — * L°°(J) acting as 

= fJskUSiM ( 2 . 26 ) 

commutes with P. In particular, if h : J — > C is a bounded P-harmonic 
function then Ah has the same property. 

A positive measure v is called P- invariant if vP = u, i.e. Jj Pf dv = J j f dv 
for all measurable / : J — ► C. In turn, one readily realizes that this condition 
is equivalent to 

^=K«,'). -€{0,1} (2.27) 
dv 
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Now, setting := {0, 1} N , a n-dimensional cylinder of f2 is a subset of the 
type G(ii, . . . , i n ) = {to G f2 : | cji = ii, . . . , u n = i n }. The cylinder sets 
generate the topology of O and its Borel <r-algebra T. 

Given x G J let f7 (a;) be the closure of the set of all possible paths starting 
at x, i.e. 

U{x) = U we n{$ Un o...o$ Ul ( a: ) ) n> 1} (2.28) 

This is clearly a compact invariant set, in the sense that if y G i7(a;) then 
G i G {0, 1}. More generally, a compact subset V of J is called 

invariant if for all x G V and all i G {0, 1} such that p(i,x) > we have 

*i(aO g y. 

A first basic fact (see |CoRaj . Sec. 3.4; or else [Jo], Chap. 2.4) is that for 
each x G J there is a unique probability measure P x on £1 such that 



P x (C7(»i, . . . , i n )) = Yl P(*fc, 



<M*)) 



(2.29) 



fc=i 



The symmetries (2.23) and (2.25) entail the following 



Lemma 2.13. For each x G J we have 

P x (C(ii, . . . , v)) =F 1/x (C(l i n )) 



(2.30) 



For u) G O let Xfc(cj) = be the /c-th coordinate function on f2 and Z n the 
subalgebra of C(O) generated by the first n coordinates {X^, 1 < k < n}. 
The Z n form a filtration in that 2 n C Z n +\. Note that if X G -2^ then 

n 

E X [X}= Y[p(wk,&u h -i o---o$ Wl (a;))X(wi,--- 

(wi,- ,w„)G{0,l}" fc=l 

In particular, if there is /i : J — > C s.t. 

Jf (wi, ■ ■ • , w„) = o ■ ■ ■ o * Ul (x)) 

then 

E X [X] = (P n h)(x) (2.31) 
Now, having fixed x G J, define 
W (x,w):=x and W n (x, w) := & Xn (u) ° ■ ■ ■ ° n > 1 (2.32) 
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The process {W n (x, • ), n > 0} defined on (O, J-$ x ) is a Markov chain on 
J with initial state x and for any measurable function / : J — » C we have 



lim (P n /)(x) = lim ExI/CWnCx, • )] 



(2.33) 



Moreover, if /i : J — > C is a measurable bounded P-harmonic function then 
we have 



E x [/l(W n+ i(x, -))l^n] 



^ p{uj n+1 ,x)h(^ i , n+1 o---o$ Wl (x)) 

w n+ i£{0,l} 

o...o* Wl (z)) 
fc(W n (x, • )) 



In other words the sequence of random variables {h(W n (x, •)), n > 0} on 
(ri,^-",ffa;) is a bounded martingale (relative to the filtration {Z n ,n > 1}) 
and therefore it converges pointwise P x -a.e. The limit random variable 
H(x, • ) = lim^^oo h(W n (x, ■ )) satisfies 



where a : 
measurable function H : J xSl 



H(x,u) = F($ a , 1 (x),(7a;) 
$7 is the left shift acting as (o~uj) 



(2.34) 



A bounded 



C satisfying (2.34) is said to be a cocycle. 



Conversely, by (2.31) h may be recovered from the cocycle H as 

h{x) =E x [H(x, •)] 



(2.35) 



Remark 2.14. Note that P : L°°(J) — ► L°°(J) /ias norm one. Therefore 
if h £ L°°(J) is an eigenfuction of P corresponding to a real and positive 
eigenvalue then the sequence {h(W n (x, ■ )), n > 0} on (O, J^^x) is a super- 
martingale, which again converges P^-a.e. to a limit cocycle H. 



Remark 2.15. As pointed out in [Jo], p. 50, eq. (2.35) can be thought of as 



an analogue of the classical result about the existence of boundary functions 
for bounded harmonic functions via Poisson integral. 

We now discuss two specific Markov chains of the above type, d enote d MC' ' 
and MC^\ corresponding to the choices q = and q = 1 in (2.17). 
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2.3.1 The Markov chain MC^ 



Setting q = in (2.17) we have ^LqI = 1. One can then consider the Markov 
(i.e. normalised) operator P^ acting as p(0)/ =\L Q f. More esplicitly, 



i 



/ 



l + x 



(2.36) 



Note that if h is P^-harmonic then, iterating (2.36) we get 
N-i 

2 fe+i -\ x + k + l 



h{x) 



N-l - 

Ofc+l 



fc=0 



+ — % + AT) 



We therefore have the 



Lemma 2.16. A bounded function h : J 



fc=0 



X + fc + 1 



zs P^ -harmonic if and only 
, x G [0, oo) 



Moreover, 



Lemma 2.17. Let p 6e as m (1.2). The probability measure dp on J is 
p(°) -invariant. 

Proof. From the fact that the function p is the distribution function of the 
(permuted) Stern-Brocot fractions (cf. Theorem 1.7 ) and Lemma 1.9 one 
readily obtains that p satisfies the functional equation 



2p(x) 



if < x < 1 



1 — x / 
p(x-l) + l if x > 1 
The claim now follows straightforwardly. □ 



(2.37) 



Setting p(0,x) = p(l,x) = 1/2 we have that there are no compact invariant 
sets and according to ( |CoRalj . Sec. IV) ) h = 1 is the only bounded con- 
tinuous P^°)-harmonic function. Moreover, the unique probability measure 
]P^ on $7 such that 

P(°)(C(H,...,V>) = 2-" (2.38) 

i s atq mless for each x £ J. The Markov chain MC 1 ' ' is then defined as in 
(2.32) on the probability space (fi,^,ffl 0) ). We summarize the above in the 
following 
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fdp 



Theorem 2.18. For f £ L 1 (M + ,d / o) we have 

lim (P^ n f)(x) = lim E^[f(W n (x, ■ )] = 

n— >oo n— >oo 

Taking / = 1( Q ,6)) {&■, b) C M + , this is to be compared with Theorem 
2.3.2 The Markov chain MCW 



(2.39) 



1.12 



Setting q = 1 in (2.17) we have L\g = g where g{x) = 1/x is the G- 
invariant density. We then consider the Markov operator acting as 
pWf = g- 1 L 1 (f.g),oi 



(P (i) /)( 



x) = 

If h is P^^-harmonic then 

N-l 



f 



X + 1 V 1 + x 



+ 



1 + x 



f(x + l) 



(2.40) 



h(x) = ^ 



X 



h 



k=0 



x + k 



(x + k)(x + k + 1) \x + k + lj x + N 



+ 



h(x + N) 



Lemma 2.19. A bounded function h : J — > C is P^ -harmonic if and only 
if 



h(x) = J2 



h 



k=0 



(x + k)(x + k + 1) \x + k + l 



x + k 



x £ [0, oo) 



Furthermore, the validity of L\g = g is equivalent to the fact that the infinite 
measure v(dx) = cfcc/a; on J is P^-'-invariant. 
Set moreover 



p(0,oo)=p(l,0) =0 , p(0,0)=p(l,oo) = l 



and 



p(0,x) 



x + l 



p(l,x) 



x + l 



, x G (0, cxd) 



(2.41) 



(2.42) 



They plainly satisfy the symmetry (2.25). Moreover, from (2.41) it follows 
that the singletons {0} and {oo} are two disjoint compact invariant sets and 
from (2.42) one sees that they are the only invariant sets of this type. 



The Markov chain MC^ is now defined as in (2.32) on the probability 



space (f2, J-,W^), where is the transition measure on 17 arising from the 
probabilities (2.41) and (2.42). It satisfies the following 



>(1) 
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Lemma 2.20. For x € (0,oo) the measures W x have no atoms. On the 
other hand, both Pq 1 ^ and P^ are purely atomic with Pq 1 ^ = 5q°° a nd P^o = 



Proof. From (2.22) and (2.42) it follows that the path of length n starting 
at x € [l,oo) and having largest probability is that corresponding to the 
word cj = 1 • • • 1. If instead < x < 1 it corresponds to uj = • • • 0. On the 
other hand we have 



wW(c(i,...,i))=v[l(c(o,...,o)) = ll 



n-1 



x 



k=0 



x + k 



x + k + 1 x + n 



as n — > oo, proving the first assertion. The last is straigthforward. □ 

Now, a path starting somewhere in J and converging to corresponds to a 
sequence of the form (ux, . . . , u n , 0, 0, 0, . . . ) for some n > 1. The symmetric 
sequence (1 — u>i, . . . ,1 — u n ,l,l,l, . . .) yields a corresponding path which 
converges to oo. By Lemma 2.13, if we let the first path start at x and the 
second one at 1/x, all finite equal portions of them have the same probability. 
We can thus concentrate on the paths starting at x and converging to 0. 
In turn, these can be put in a one-to-one correspondence with Q2 via the 
mapping 



Q 2 3 a = LOi2 1 ^ u(a) = (ux, ■ ■ ■ , u n , 0, 0, 0, . . . ) 

i=l 

Another copy of Q2 is obtained via the mapping 

n 

1-Qa B 1-a = ^(1-^)2^+2-™ ^ w(l-a) = (1-oj 1: 
i=i 

With the identification a *-* uj{a) we set 



, l-w„, 1,1,1,... 



(2.43) 



so that Lemma 2.20 can be rephrased in the form 

P«(Q 2 )+P( 1 )(1-Q 2 ) = ^ + * 



■CO 
X 



(2.44) 



Finally, putting together the above and ( |CoRal] . Sec. IV) we get the 
following 
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Theorem 2.21. The space of bounded harmonic functions has dimension 
two. A basis for it is given by the functions 



ho(x) = 1 and h\(x) = w(x) + w(l/x) 



where 



w(x) 



>(l) 



lim W n (x, 







>(1) 

l/x 



lim W n (l/x, 

_n— >oo 



OO 



Remark 2.22. Having fixed x,y £ J let B(x,y) be the tail event 
B(x,y) := {to £ O : lim W n (x,u;) = y} 



According to (2.35), the cocycle H\ associated to h\ is 

Hx(x,u) = 1b( x ,0)UB(x,oo)( w ) 



Note that, eg. (2.44) can be further rephrased as 

Pi 1 ) [B(x, 0)] +P« [B(x, oo)] = S° x + <5- 
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